Anticipatory action selection for human-robot table tennis

نویسندگان

  • Zhikun Wang
  • Abdeslam Boularias
  • Katharina Mülling
  • Bernhard Schölkopf
  • Jan Peters
چکیده

Anticipation can enhance the capability of a robot in its interaction with humans, where the robot predicts the humans’ intention for selecting its own action. We present a novel framework of anticipatory action selection for human-robot interaction, which is capable to handle nonlinear and stochastic human behaviors such as table tennis strokes and allows the robot to choose the optimal action based on prediction of the human partner’s intention with uncertainty. The presented framework is generic and can be used in many human-robot interaction scenarios, for example, in navigation and human-robot co-manipulation. In this article, we conduct a case study on human-robot table tennis. Due to the limited amount of time for executing hitting movements, a robot usually needs to initiate its hitting movement before the opponent hits the ball, which requires the robot to be anticipatory based on visual observation of the opponent’s movement. Previous work on Intention-Driven Dynamics Models (IDDM) allowed the robot to predict the intended target of the opponent. In this article, we address the problem of action selection and optimal timing for initiating a chosen action by formulating the anticipatory action selection as a Partially Observable Markov Decision Process (POMDP), where the transition and observation are modeled by the IDDM framework. We present two approaches to anticipatory action selection based on the POMDP formulation, i.e., a model-free policy learning method based on Least-Squares Policy Iteration (LSPI) that employs the IDDM for belief updates, and a model-based Monte-Carlo Planning (MCP) method, which benefits from the transition and observation model by the IDDM. Experimental results using real data in a simulated environment show the importance of anticipatory action selection, and that POMDPs are suitable to formulate the anticipatory action selection problem by taking into account the uncertainties in prediction. We also show that existing algorithms for POMDPs, such as LSPI and MCP, can be applied to substantially improve the robot’s performance in its interaction with humans.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Intention Inference and Decision Making with Hierarchical Gaussian Process Dynamics Model

Anticipation is crucial for fluent human-robot interaction, which allows a robot to independently coordinate its actions with human beings in joint activities. An anticipatory robot relies on a predictive model of its human partners, and selects its own action according to the model’s predictions. Intention inference and decision making are key elements towards such anticipatory robots. In this...

متن کامل

The Communication Skills as Selection Criteria of Iranian National Table Tennis Coach: Sport Elites Perspectives

The purpose of this study was to determine the prioritization and comparison of communication skills as a selection criterion of Iran's national table tennis coach from the sport elites perspectives. As this was a descriptive study, survey methodology was employed. The study population consisted of 100 table tennis sport elites of whom 80 subjects were randomly selected using the Morgan table. ...

متن کامل

Title: Cost-Based Anticipatory Action Selection for Human-Robot Fluency

A crucial skill for fluent action meshing in human team activity is a learned and calculated selection of anticipatory actions. We believe that the same holds for robotic teammates, if they are to perform in a similarly fluent manner with their human counterparts. In this work we describe a model for human robot joint action, and propose an adaptive action selection mechanism for a robotic team...

متن کامل

Cost-Based Anticipatory Action Selection for Human-Robot Fluency

A crucial skill for fluent action meshing in human team activity is a learned and calculated selection of anticipatory actions. We believe that the same holds for robotic teammates, if they are to perform in a similarly fluent manner with their human counterparts. In this work we describe a model for human robot joint action, and propose an adaptive action selection mechanism for a robotic team...

متن کامل

Learning to Select and Generalize Striking Movements in Robot Table Tennis

Learning new motor tasks autonomously from interaction with a human being is an important goal for both robotics and machine learning. However, when moving beyond basic skills, most monolithic machine learning approaches fail to scale. In this paper, we take the task of learning table tennis as an example and present a new framework which allows a robot to learn cooperative table tennis from in...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Artif. Intell.

دوره 247  شماره 

صفحات  -

تاریخ انتشار 2017